Robust SVD Method for Missing Value Estimation of DNA Microarrays
نویسندگان
چکیده
A majority of DNA microarray datasets contain missing or corrupt values and it is critical to estimate these values accurately. These missing values are most often attributed to insufficient experimental resolution or the presence of foreign objects on the experimental slide’s surface. To improve existing missing value estimation algorithms, this paper introduces and investigates the scalable singular value decomposition (SSVD) solver, which is an improvement upon the Jacobi singular value decomposition (SVD) approach. Experiments were conducted on a study comparing SSVD to the Jacobi and QR SVD methods against several legitimate microarray datasets. The robustness of SSVD is verified by subjecting it to several test cases containing 1–20% of missing values. For nearly all of the test cases across all configurations of missing value percentages, SSVD provides more accurate recovery results than Jacobi and SQ SVD. These numerical results strongly suggest SSVD is a robust and scalable solver.
منابع مشابه
Missing value estimation methods for DNA microarrays
MOTIVATION Gene expression microarray experiments can generate data sets with multiple missing expression values. Unfortunately, many algorithms for gene expression analysis require a complete matrix of gene array values as input. For example, methods such as hierarchical clustering and K-means clustering are not robust to missing data, and may lose effectiveness even with a few missing values....
متن کاملCollateral Missing Value Estimation: Robust Missing Value Estimation for Consequent Microarray Data Processing
Microarrays have unique ability to probe thousands of genes at a time that makes it a useful tool for variety of applications, ranging from diagnosis to drug discovery. However, data generated by microarrays often contains multiple missing gene expressions that affect the subsequent analysis, as most of the times these missing values are ignored. In this paper we have analyzed how accurate esti...
متن کاملAdjustable Robust Singular Value Decomposition: Design, Analysis and Application to Finance
The Singular Value Decomposition (SVD) is a fundamental algorithm used to understand the structure of data by providing insight into the relationship between the row and column factors. SVD aims to approximate a rectangular data matrix, given some rank restriction, especially lower rank approximation. In practical data analysis, however, outliers and missing values maybe exist that restrict the...
متن کاملHeuristic Non Parametric Collateral Missing Value Imputation: A Step Towards Robust Post-genomic Knowledge Discovery
Microarrays are able to measure the patterns of expression of thousands of genes in a genome to give profiles that facilitate much faster analysis of biological processes for diagnosis, prognosis and tailored drug discovery. Microarrays, however, commonly have missing values which can result in erroneous downstream analysis. To impute these missing values, various algorithms have been proposed ...
متن کاملA Simultaneous Reconstruction of Missing Data in DNA Microarrays
We suggest here a new method of the estimation of missing entries in a gene expression matrix, which is done simultaneously— i.e., the estimation of one missing entry influences the estimation of other entries. Our method is closely related to the methods and techniques used for solving inverse eigenvalue problems. 2000 Mathematical Subject Classification: 15A18, 92D10
متن کامل